ATARI environments - part1 #277

JulienT01 · 2023-02-15T10:37:16Z

PART1.
Add and update code to train on atari games (DQN and PPO on ALE/Breakout and ALE/Freewy):

Udate 'agents/torch/utils/models.py' to have better management of the size to connect the CNN and the head (MLP). And to manage the "multi-batch" dimensions. Atari wrapper give input by batch (for the dynamics), and DQN use chunk, so we have to merge this 2 kind of batch in 1, make the forward, then split the result in the previous batches
update 'agents/torch/utils/training.py' to manage the 'transpose_obs' parameters in automatic model_config (and don't overwritte previous settings)

TODO part 2 (Other PR) : #285

update atari_make (in gym_make.py) and 'DQN agent' to generate and manage vectorized environments.
update A2C and PPO agent to manage the vectorized environments

…i (NON-vectorized env only)

for more information, see https://pre-commit.ci

restart test pipeline...

for more information, see https://pre-commit.ci

…ffer

for more information, see https://pre-commit.ci

…nto xfail_tests_mac_windows

…O_buffer

…_part1

for more information, see https://pre-commit.ci

…_part1

mmcenta

Just one note: I need help understanding what the ScalarizeEnvWrapper is doing here. It feels like it really only should be used when n_envs = 1; otherwise, you're giving the same action to potentially different environments, right?

Otherwise, LGTM.

mmcenta · 2023-04-07T14:15:51Z

rlberry/agents/torch/utils/models.py

    def convolutions(self, x):
        x = x.float()
-        if len(x.shape) == 3:
+        if (


maybe move this comment to the line above so black doesn't split this into three lines

mmcenta · 2023-04-07T14:18:43Z

rlberry/envs/gym_make.py

+    scalarize = True
+
+    if "atari_wrappers_dict" in kwargs.keys():
+        atari_wrappers_dict = kwargs["atari_wrappers_dict"]


You can just do atari_wrappers_dict = kwargs.pop('atari_wrappers_dict') for the same effect.

mmcenta · 2023-04-07T14:19:45Z

rlberry/envs/gym_make.py

@@ -32,14 +33,59 @@ def gym_make(id, wrap_spaces=False, **kwargs):
    return Wrapper(env, wrap_spaces=wrap_spaces)


-def atari_make(id, scalarize=True, **kwargs):
+def atari_make(id, scalarize=None, **kwargs):


Why True to None? Shouldn't it be False?

for more information, see https://pre-commit.ci

TimotheeMathieu

A small comment, otherwise LGTM.

TimotheeMathieu · 2023-04-12T08:39:07Z

rlberry/envs/gym_make.py

@@ -32,14 +33,58 @@ def gym_make(id, wrap_spaces=False, **kwargs):
    return Wrapper(env, wrap_spaces=wrap_spaces)


-def atari_make(id, scalarize=True, **kwargs):
+def atari_make(id, scalarize=False, **kwargs):


A small docstring?

…_part1

for more information, see https://pre-commit.ci

JulienT01 added 6 commits February 15, 2023 10:56

add requirements to use atari game

3e874be

update atari_make, and the wrapper scalarize, to manage env from atar…

74686f6

…i (NON-vectorized env only)

update training and models to manage cnn (mandatory for atari games)

b6739d5

add tests on atari games (test the cnn part in dqn and ppo too)

32a5703

add example with video for the documentation

54fc405

black

469a86c

JulienT01 added documentation Improvements or additions to documentation enhancement New feature or request dependencies Pull requests that update a dependency file ready for review labels Feb 15, 2023

JulienT01 added 2 commits February 15, 2023 14:22

black

92286b9

update setup.py

ae0ed61

JulienT01 requested review from TimotheeMathieu, AleShi94, KohlerHECTOR and mmcenta February 15, 2023 14:01

JulienT01 and others added 14 commits February 16, 2023 14:42

add pytest-xprocess to run test_server.py

b701b29

add configfiles to .gitignore

ae587f7

change to fixed version image azure

dac452f

Merge branch 'rlberry-py:main' into Atari_part1

c9c0248

remove accelerate

9cc2efe

Update README.md

8206d53

Merge remote-tracking branch 'origin/main' into Atari_part1

daae996

temporary correction until main branch update

a21f5cd

Merge remote-tracking branch 'origin/main' into Atari_part1

3d242ae

xfail on tests that failed on Mac and windows

91ea0d1

[pre-commit.ci] auto fixes from pre-commit.com hooks

40ddbab

for more information, see https://pre-commit.ci

Update README.md

314f532

restart test pipeline...

optuna more graceful cleaning

7b3a692

shutils rmtree to os.rmdir

dbe3e33

JulienT01 and others added 20 commits April 3, 2023 17:13

use temporary folder instead

35aa58b

generalize PPO tests to 'check_agent'

02e69b1

[pre-commit.ci] auto fixes from pre-commit.com hooks

9073353

for more information, see https://pre-commit.ci

flake

799ceda

Merge branch 'PPO_buffer' of github.com:JulienT01/rlberry into PPO_bu…

9af6369

…ffer

patch : stableBaselines don't have get_params()

a49cbab

[pre-commit.ci] auto fixes from pre-commit.com hooks

4377431

for more information, see https://pre-commit.ci

Empty-Commit

ef0d917

update doc

e8f0b29

Empty-Commit

3789caf

Merge branch 'fix_ci' of https://github.com/TimotheeMathieu/rlberry i…

d6a63b7

…nto xfail_tests_mac_windows

don't remove PyOpenGL_accelerate

d310dd2

Merge remote-tracking branch 'origin/xfail_tests_mac_windows' into PP…

90faf20

…O_buffer

Merge remote-tracking branch 'origin/PPO_buffer' into Atari_part1

3993823

Merge branch 'main' into Atari_part1

9f5f217

add tests for atari empty input dim

03afc5c

Merge branch 'Atari_part1' of github.com:JulienT01/rlberry into Atari…

5ae295b

…_part1

[pre-commit.ci] auto fixes from pre-commit.com hooks

d321a99

for more information, see https://pre-commit.ci

remove test (already exist in "check_agent.py")

fad2801

Merge branch 'Atari_part1' of github.com:JulienT01/rlberry into Atari…

80f1220

…_part1

mmcenta approved these changes Apr 7, 2023

View reviewed changes

JulienT01 and others added 2 commits April 11, 2023 09:10

updades following Matheus review

0c26794

[pre-commit.ci] auto fixes from pre-commit.com hooks

46ad222

for more information, see https://pre-commit.ci

TimotheeMathieu approved these changes Apr 12, 2023

View reviewed changes

JulienT01 and others added 5 commits April 12, 2023 11:08

add docstring for atari_make

0cbd974

Merge branch 'Atari_part1' of github.com:JulienT01/rlberry into Atari…

6242b5a

…_part1

[pre-commit.ci] auto fixes from pre-commit.com hooks

5439abf

for more information, see https://pre-commit.ci

Merge branch 'rlberry-py:main' into Atari_part1

47e7c09

update changelog

ca981ac

JulienT01 merged commit 5031054 into rlberry-py:main Apr 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ATARI environments - part1 #277

ATARI environments - part1 #277

JulienT01 commented Feb 15, 2023 •

edited

Loading

mmcenta left a comment

mmcenta Apr 7, 2023

mmcenta Apr 7, 2023

mmcenta Apr 7, 2023

TimotheeMathieu left a comment

TimotheeMathieu Apr 12, 2023

ATARI environments - part1 #277

ATARI environments - part1 #277

Conversation

JulienT01 commented Feb 15, 2023 • edited Loading

mmcenta left a comment

Choose a reason for hiding this comment

mmcenta Apr 7, 2023

Choose a reason for hiding this comment

mmcenta Apr 7, 2023

Choose a reason for hiding this comment

mmcenta Apr 7, 2023

Choose a reason for hiding this comment

TimotheeMathieu left a comment

Choose a reason for hiding this comment

TimotheeMathieu Apr 12, 2023

Choose a reason for hiding this comment

JulienT01 commented Feb 15, 2023 •

edited

Loading